Spoken control for user construction of complex behaviors

ABSTRACT

A device interface system is presented. Contemplated device interfaces allow for construction of complex device behaviors by aggregating device functions. The behaviors are triggered based on conditions derived from environmental data about the device.

This application claims the benefit of priority to U.S. provisionalapplication having Ser. No. 61/604,873, filed Feb. 29, 2012.

FIELD OF THE INVENTION

The field of the invention is device interface technologies.

BACKGROUND

As mobile computing technology becomes more ever-present in our dailylives, mobile device users become more and more reliant on functionalityprovided by their mobile devices. Ideally, mobile devices, or othercomputing devices, should allow users to specify complex behaviors.Allowing users to specify complex behaviors gives rise to customizationof device actions with respect to different situations or conditions tobetter serve the user's needs.

An online article called “Automate your Motorola Razr with SmartActions” by Jack Wallen, published Jan. 17, 2012 on TechRepublic.Com(see URLwww.techrepublic.com/blog/smartphones/automate-your-motorola-razr-with-smart-actions/4215)describes the state of a configurable smart phone. The application onthe device allows users to specify a number of complex behaviors bycreating rules which may utilize the many basic functions of the device.For example, a user can create a rule that causes the device to besilenced based on its knowing a particular location (say home) and aparticular time of day (say 6:00-7:00 pm). This application supportsonly touch-base controls and does not support technologies capable ofmaking even simple rule creation totally effortless and intuitive suchas through speech recognition or natural language processing. Thetechnology cannot convert spoken input signals to create rules governingcomplex behaviors for the device to perform.

Others have put forth an effort toward offering user control of devicebehavior. One example includes U.S. patent application publication2011/0254792 to Waters et al. titled “User Interface to Provide EnhancedControl of an Application Program”, filed as an internationalapplication on Dec. 18, 2009. Waters merely focuses on providing supportfor touch-based controls rather than a natural multi-modal dialoginteraction. Still another example includes U.S. Pat. No. 7,154,862 toKrzyzanowski titled “Device Control System, Method, and Apparatus forServer-Based or Peer-to-Peer Network Environments”, filed Feb. 23, 2004.Krzyzanowski seeks to provide a unified interface via handheld devices.Still further, U.S. Pat. No. 7,302,394 to Baray et al. titled “Front-EndDevice Independence for Natural Interaction Platform”, filed Dec. 20,2002, contemplates providing a natural language interpretation systemthat provides commands to back end applications.

U.S. Pat. No. 8,346,563 B1 to David Hjelm, Robert Kriiger, BjornGiilsdorff, Ebba Birgitta Gustavii, and Maria Cheadle titled “System andmethods for delivering advanced natural language interactionapplications”, patent date: Jan. 1, 2013 describes a system fordelivering advanced natural language interaction applications,comprising a dialog interface module, a natural language interactionengine, a solution data repository component operating comprising atleast one domain model, at least one language model, and a plurality offlow elements and rules for managing interactions with users, and aninterface software module. Users submit requests via a network which arepreprocessed by the dialog interface module and subsequently transmittedto the natural language interaction engine. Using language recognitionrules stored in the solution data repository and the determined semanticmeaning or user intent, the natural language interaction engine formsand delivers an appropriate response or makes an appropriate actionbased on the request. Hjelm et al. describe a generic approach tohuman-computer conversational dialog management. The work fails todistinguish a system responding in the sense of “executing an action”from system responding that entails complex device behavior. Hjelm etal. fails to address the creation or use of triggers in the creation ofcomplex device behaviors. Additionally, the work does not discuss thecreation of complex device behaviors from one or more functionprimitives.

U.S. Pat. No. 7,751,884 B2 to David Ternes, Dong M. Birkholz, David W.Yost, and James A. Esler titled “Flexible Neural Stimulation Engine”issued Jul. 6, 2010 regards an implantable medical pacemaker device thatmonitors device state and the previous behavior of the device relativeto state. Input to the device affects device state which, in turn, cancause the device to take action based upon the change in device stateand neural event timers. Ternes et al. makes some progress regarding themonitoring of device state and taking an action as a consequence of thestate of the device. Ternes et al.; however, fails to distinguish areactive action from a complex behavior. Ternes et al. do not discloseto any method for the creation of and use of triggers in the creationand performance of complex device behaviors constructed from functionprimitives. Additionally, the work fails to describe any application ofthe methods to additional problem domains.

U.S. Patent Publication Number US 2011/0161076 A1 to Bruce L. Davis,Tony F. Rodriguez, William Y. Conwell and Geoffrey B. Rhoads titled“Intuitive Computing Methods and Systems” published Jun. 30, 2011describes a smart phone configured to sense audio, imagery or otherstimulus data of its user's environment. Based upon such input, thesystem can act autonomously in accordance with or fulfillment of theinferred desires of the user. Davis et al. make progress in methods todetermine user intent and enabling autonomous reaction to user input.Additionally, Davis et al. describe progress in the use of phone sensorsto determine device state and subsequently tailor device behaviors ortrigger an action. Davis et al. fails to address however reactions thatconstitutes complex device behaviors. They further fail to describe thecreation and use of triggers in the creation and performance of complexdevice behaviors constructed from function primitives.

U.S. Patent Publication Number US 2009/0112832 A1 to Eser Kandogan, JieLu and Michelle Xue Zhou titled “Intelligent Content Assistance”published Apr. 30, 2009 describes a method for generating one or morecontext-sensitive content recommendations by detecting the informationneeds of a user, retrieving one or more content-recommendation templatesthat match the detected information needs and instantiating theretrieved templates with one or more parameter values to generaterecommended contents. The system generates these recommendations duringbusiness processes. It dynamically monitors and detects the needs of auser in a particular context. Additionally, it retrievescontent-recommendation templates that appear to match the informationneeds of the user, instantiating them using information from user input,context, interaction history, system-learned query and content modelsand external sources. Context, interaction history and system-learnedmodels are updated dynamically for learning and adaptation. Kandogan etal. makes some progress toward using environmental, situational orcontextual factors in the detection of the information needs of a user.The work further advances the retrieval of content recommendationtemplates that appear to match the information needs of the user.Kandogan et al. additionally makes progress in regards to the use ofcontext, interaction history, system learned models and content modelsdomains in the determination of system responses. Kandogan et al.however lacks insight with respect to any complex device behavior.Kandogan et al. further fail to describe the creation and use oftriggers in the creation and performance of complex device behaviorsconstructed from function primitives.

U.S. Pat. No. 8,332,224 B2 to Philippe Di Cristo, Chris Weider andRobert A. Kennewick titled “System and Method of Supporting AdaptiveMisrecognition Conversational Speech” issued Dec. 11, 2012 describes asystem and methods that support multimodal input including naturallanguage speech input. The system interprets spoken utterances or otherreceived non-spoken inputs using a conversational interface thatincludes a conversational speech analyzer, a general cognitive model, anenvironmental model, and a personalized cognitive model to determinecontext, domain knowledge, and invoke prior information. The system doesnot address sensor data or the functional states of a device. The systemcreates, stores and uses extensive personal profile information for eachuser, thereby improving the reliability of determining the context ofthe speech or non-speech communication and presenting the expectedresults for a particular question or command. Cristo et al. exhibitprogress regarding the use of a personalized cognitive model indetermining the context of speech and non-speech communication. TheCristo et al. system does not address sensor data or the functionalstates of a device however. Cristo et al. also fails to address theconversion of signals to complex device behavior. Cristo et al. alsofails to address the creation and use of triggers in the creation andperformance of complex device behaviors constructed from functionprimitives.

U.S. Patent Publication Number US 2013/0031476 A1 to Emmett Coin,Deborah Dahl and Richard Mandelbaum titled “Voice Activated VirtualAssistant” published Jan. 31, 2013 describes a multimodal conversationalinterface that interprets spoken natural language utterances, visualcues, images and/or other forms of input. The system includes aconversation manager that relies on a set of functions defining veryflexible adaptive scripts. During interaction with a user, theconversation manager obtains information from the user, refining ordefining more accurately what information is required by the user,asking the user for additional information to determine the user'sintent. User responses result in the selection of scripts or subscriptsrelevant to subsequent actions. The system may obtain additional datarelevant to the dialog from local sensors or remotely from other sourcesduring an interaction. Remote sources are accessed by activation of anappropriate function such as a search engine. Coin et al. make progressregarding the interactive determination of user intent, the selection ofscript or subscripts that are relevant to subsequent behaviors and theuse of sensor data. Coin et al. however fail to provide insight into thecreation of complex device behaviors. Coin et al. however fail toprovide insight into the creation of complex device behaviors. Coin etal. also fails to address the creation and use of triggers in thecreation and performance of complex device behaviors constructed fromfunction primitives.

These references fail to provide insight into how users can be enabledto construct complex device behaviors based on natural interactions witha device environment, especially in a context where previousinteractions can affect future device states. Furthermore, thereferences fail to provide any insight into using primitives toinstantiate a future device behavior. There thus remains a need forsystems and methods that address both the conversion of signals tocomplex device behavior and creating and using triggers for complexdevice behavior from function primitives.

These and all other extrinsic materials discussed herein areincorporated by reference in their entirety. Where a definition or useof a term in an incorporated reference is inconsistent or contrary tothe definition of that term provided herein, the definition of that termprovided herein applies and the definition of that term in the referencedoes not apply.

Unless the context dictates the contrary, all ranges set forth hereinshould be interpreted as being inclusive of their endpoints, andopen-ended ranges should be interpreted to include commerciallypractical values. Similarly, all lists of values should be considered asinclusive of intermediate values unless the context indicates thecontrary.

Thus, with the difficulty of using graphical interfaces, particularly onsmall devices with limited display area, the increase need for effectiveassistant behaviors on computing devices, and the greater power andflexibility of spoken interfaces, there is still a need for a spokenmethod for user creation of complex behaviors.

SUMMARY OF THE INVENTION

The inventive subject matter provides apparatus, systems and methods inwhich allow a user to interact with a computing device, possibly a cellphone, by speaking and creating complex behaviors for that computingdevice, that would provide more effective, assistant like behaviortailored to that user's needs. One aspect of the inventive subjectmatter includes a device interface, preferably a mobile deviceinterface, which allows a user to construct complex device behaviorsbased on historical interactions. Contemplated interfaces include adialog interface module that can capture a digital representation of aspoken utterance from a user. Preferably the digital representationincludes a signal having one or more derivable characteristics (e.g.,tone, words, tokens, frequency, modulation, envelop, touch, temperature,gestures, images, physical location, speed, etc.). The device interfacecan also include one or more connections to data sources through whichthe device interface can obtain information related to the device'senvironment. Preferably, the device interface also includes aninteraction history database storing data objects representing pastinteractions between the user and the device, possibly based on multiplemodalities or previously executed device functions. Still further, thedevice interface can comprise a triggering module coupled with thevarious components of the device interface. The module can configure togenerate a trigger that causes the device to take one or more actionsgiving rise to complex behaviors. For example, the triggering module canconvert the signal, possibly based on characteristics derived from thesignal, to one or more device behaviors. The triggering module canfurther create a trigger based on the device behaviors and device statederived from the data sources. Once the trigger conditions have beenmet, the device can be configured to execute the device behaviors.

Various objects, features, aspects and advantages of the inventivesubject matter will become more apparent from the following detaileddescription of preferred embodiments, along with the accompanyingdrawing figures in which like numerals represent like components.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 provides an overview of an ecosystem where a future device statecan be triggered based on observations of interactions.

FIG. 2 illustrates construction of a query to the interaction historydatabase based on environment data attributes.

FIG. 3 illustrates an example of a device state.

FIG. 4 illustrates how a device state maps to device functionprimitives.

FIG. 5 illustrates the construction of future device behaviors.

FIG. 6 illustrates a possible trigger that can cause a device to exhibita future behavior.

FIG. 7 shows a methods of triggering a future device behavior

FIG. 8 shows a schematic of the spoken dialog interface interacting withthe user to create complex behavior involving control of the ringtriggered on calendar information.

FIG. 9 shows a schematic of the spoken dialog interface interacting withthe user to create complex behavior involving control of an externaldevice triggered on calendar and location information.

DETAILED DESCRIPTION

It should be noted that while the following description is drawn to acomputer/server based device interaction systems, various alternativeconfigurations are also deemed suitable and may employ various computingdevices including servers, interfaces, systems, databases, agents,peers, engines, controllers, or other types of computing devicesoperating individually or collectively. One should appreciate thecomputing devices comprise a processor configured to execute softwareinstructions stored on a tangible, non-transitory computer readablestorage medium (e.g., hard drive, solid state drive, RAM, flash, ROM,etc.). The software instructions preferably configure the computingdevice to provide the roles, responsibilities, or other functionality asdiscussed below with respect to the disclosed apparatus. In especiallypreferred embodiments, the various servers, systems, databases, orinterfaces exchange data using standardized protocols or algorithms,possibly based on HTTP, HTTPS, AES, public-private key exchanges, webservice APIs, known financial transaction protocols, or other electronicinformation exchanging methods. Data exchanges preferably are conductedover a packet-switched network, the Internet, LAN, WAN, VPN, or othertype of packet switched network.

One should appreciate that the disclosed techniques provide manyadvantageous technical effects including providing an infrastructurecapable of generating one or more signals that configure a mobile deviceto present possible interactions for a user that might be of interest.

The following discussion provides many example embodiments of theinventive subject matter. Although each embodiment represents a singlecombination of inventive elements, the inventive subject matter isconsidered to include all possible combinations of the disclosedelements. Thus if one embodiment comprises elements A, B, and C, and asecond embodiment comprises elements B and D, then the inventive subjectmatter is also considered to include other remaining combinations of A,B, C, or D, even if not explicitly disclosed.

As used herein, and unless the context dictates otherwise, the term“coupled to” is intended to include both direct coupling (in which twoelements that are coupled to each other contact each other) and indirectcoupling (in which at least one additional element is located betweenthe two elements). Therefore, the terms “coupled to” and “coupled with”are used synonymously. Further, “coupled to” and “coupled with” areconstrued to mean “communicatively coupled with” in a networkingcontext.

The inventive subject matter is considered to include a spoken dialogdevice interface architecture with models to support interactions forthe creation of complex device behaviors, support for execution of thecomplex behaviors and coupling with a variety of databases, sensor data,or other information data sources associated with the device, or withapplications on the device or accessible through the device.Interactions to create behaviors could include simple commands or morecomplex dialog interactions, possibly multi-modal interactions, with thedevice in order to specify the desired behaviors. One should furtherappreciate that a dialog can comprise representations of communicationmodalities beyond spoken utterances as well. For example, an inputsignal can be analyzed to derive characteristics of spoken utterances(e.g., words, tokens, syntax, modulation, frequency, tones, accents,etc.), haptic data, tactile data, temperature data, motion data,location data, or other types of sensor data that can be associated witha dialog interaction.

FIG. 1 presents an overview of an ecosystem that can be located on acomputing device 130, preferably a mobile device functioning as avirtual assistant. The mobile device 130 preferably includes a smartphone or could include a tablet, a computer, an appliance, a consumerelectronic device, a vehicle, or other types of devices. The device 130processes spoken input represented by spoken utterances 120 from a user110 and allows the user 110 to create, through spoken dialog interactionwith the device 130, a customized, complex future behavior 170 for thedevice 130.

Utterances 120 can be converted to signals, which in turn can beanalyzed to derive signal characteristics reflecting the properties ofthe utterances. For example, the signal characteristics (e.g., waveforms, envelops, frequencies, tone, etc.) can be mapped to one or morewords, concepts, or other types of objects. One aspect of the inventivesubject matter includes a dialog interface that interprets userutterances 120 in the context of possible functions, behaviors, ortrigger conditions for the device 130. This is achieved by processingthe incoming spoken utterance 120 with the dialog interface 133. Theinterpretation comprises of a set of key value pairs and an identifierof the concept related to the request. This interpretation is sent on tothe triggering module 150. For example, an utterance such as ‘pleaseremind me to call my mother when I get home’, can be mapped to‘action=reminder, function=phone call, phone number=mother,location=′home’. Additionally, inputs from other modalities such astouch, text, gesture etc. are likewise mapped to key/value pairs via thedialog interface 133.

Triggering module 150 finds one or more possible matches between thekey/value pairs and primitive device functions stored in the devicefunction database 140. The set of possible primitive functions dependson the capabilities of a particular device 130 and stored in the devicefunction database 140. The triggering module 150 checks if the incomingutterance 120 has sufficient information to create a new futurebehavior. A future behavior 170 can be created when there is at leastone action, one primitive function (e.g. location) and a triggercondition available. Generally, the complex behavior may have one ormore actions, and zero or more conditional triggers for those actions.If there is not sufficient information, the triggering module 150 willsend this information to the dialog interface 133 which in turn willconstruct a question to the user 110 regarding the missing information.

A possible technique to determine if sufficient information isavailable, is for the triggering module 150 to create a query 155 basedon the current set of environment data 135 to the interaction historydatabase 160 to retrieve previous interactions. This is described inmore detail in FIG. 2.

In FIG. 2, the triggering module 200 first requests the current set ofenvironment data attributes 220. Based on both the current key valuepairs of the user input and the retrieved environment data attributes220, the triggering module assembles a query 230 to the interactionhistory database 250 in order to attempt to resolve missing pieces ofinformation that are required to create the requested future behavior,such as the phone number or location in the example above. Theinteraction history database 250 passes on the query 230 to the userpreferences database 253 and to the future device behavior database 255.The combined results 260 from all three databases are then returned tothe triggering module 200, which in turn evaluates the results andupdates the key value sets accordingly. A different variation of thequery creation entails using the signature of the current device statein order to find similar previous interactions with similar device statesignatures.

Referring back to FIG. 1, contemplated systems 100 would accessinformation related to one or more users 110 such as the interactionhistory database 160 storing information about previous userinteractions, databases of calendar events, contact lists, userpreferences or any other databases of relevant information availablefrom the device 130. In the contemplated systems 100, Internet search orother interactions with a network site would be both a possible actionas part of a behavior, or a possible data source of conditional triggerinformation. In response to the user utterances 120 and descriptions ofcomplex behavior, the system would represent the sequence of simpleactions with associated conditions or triggers and send messages to theappropriate applications or device functions when the executionconditions for the complex behavior were met.

The interaction history database 160 stores information about priorinteractions and is used to support the natural flow of the currentdialog interaction or to supply information about past user behavior,past device behavior, or past device functions that could be relevant tocomplex behaviors or trigger conditions. Additionally, the interactionhistory database 160 also includes information about the user'spreferences. This information is also utilized to resolve ambiguities orunknown values. In the example above ‘please remind me to call my motherwhen I get home’, the values for “phone number=mother” and“location=home” could be resolved via the user preferences to themother's phone number and the GPS location values for the home addressof the user.

Additionally, the interaction history database 160 can store informationrelated to past interactions with the device 130, the environment, theuser 110, or other factors. For example, historical interactions couldinclude one or more previously constructed device behaviors comprisingone or more device functions, which can be used as an example for futuredevice behaviors. The parameters governing the actions taken by a device130 for a previous device behavior can be altered to conform with theneeds for a current or future device behaviors.

The environment data 135 is being made available via the data sourceinterface 137. In addition to local environment data 135 inherent to thedevice 130, the data source interface 137 makes a variety of informationsources available communication, for example with the World Wide Web.The data sources can be located anywhere as long as they can becommunicatively coupled to the data source interface 137. For example, adata source could include a data source internal to the device 130itself; one or more applications, or the device operating system.Further, the data sources can be external to the device, possiblyaccessible over a network data source connection (e.g., LAN, WAN, Wi-Fi,the Internet, cell network, GPS, accelerometer, external sensors, etc.).Example external data sources can include various network sites: websites, news sites, radio or television stations, etc.

In addition to accessing the environment and interaction history data,the triggering module 150 also accesses the current device state 145 andits associated values for the set of primitive device functions. Thecombined set of values for all the primitive device functions can beseen as a state signature. Moreover, the device state has a temporalaspect: there always exists a past, current and future device state. Anillustration of this can be seen in FIG. 3, which presents a possibleexample of a device state. The entirety of the device state 350 includesthe sum of functional device states 300 such as the current local time,location, device settings such as ringer mode, ringer volume, or otherattributes. Next, there is the device hardware state 310, whichcomprises of the current values for functions such as battery level,powered status etc. The device state also tracks past device states 320and projects future device states 330. The device state is updated atfrequent and regular time intervals. For example, the device state maybe updated every minute and each time there are changes in the status ofthe primitive functions, i.e. each time the device's location changes bymore than x feet etc.

Again referring back to FIG. 1, in some embodiments, the contemplatedsystem can comprise a device function database 140 (e.g., a listing offunctions that are device independent, functions that aredevice-specific, device primitives, etc.). The database 140 could beindexed by the characteristics of the input signal (e.g., utterance 120)via the triggering module 150. If the signal includes a word that mapsto “reservation”, or a “reservation concept”, then the “reservation”concept can be used as a query to obtain related device functions formaking a reservation (e.g., returning pointers to a calendarapplication, return I/O options for seeking additional user input,etc.). The returned functions can then be aggregated to form acorresponding complex behavior: making a reservation by (a) query foradditional input from a user 110, (b) interfacing to calendaringprogram, (c) seeking previous interactions indicating preferences forthe type of reservation, or other device actions. The device behaviorscan also be constructed based on known or previously executed devicefunctions, possibly obtained from the interaction history database.

Moreover, any device function or device state 145 of the device and anyinformation accessible through the device 130 would be available to thespeech-based behavior creation as a possible action to be taken or acondition or trigger including email, location information, calendarfunctions, reminders, turning the device on and off, the state of thedevice's battery, prior interactions, user preferences, volume settings,entertainment functions such as playing music or viewing video, internetsearch, camera functions, social networking functions, or use ofapplications on the device.

FIG. 4 shows how this device state is updated from two main informationsources. The set of primitive functions 410 sends an update 420 to thedevice state 400 each time there is a change in one of the values of theprimitive functions. Additionally, each time the device state is queriedby the triggering module, the device state in turn queries theenvironment data 430 for updates 435.

FIG. 5 illustrates a method 500 for the construction of a futurebehavior. First, at step 505, the triggering module evaluates theuser-provided key value pairs or attributes and at step 510 determinesif one of the incoming key value pairs matches a primitive functionrequest. Each available primitive function of a system has always one ormore primitive function request associated with them. Additionally,primitive function requests can have one or more attributes. Forexample, to turn on the heat at home one associated attributes might bethe turn-on time. Then, the triggering module searches the database ofexisting future behaviors with similar request types for other possibleattributes or triggers at step 515. Following this, the triggeringmodule looks up the related environment data and interaction history asdiscussed above to fill any many attributes internally as possible atstep 520. Next, the triggering module determines if enough informationis available to create a future behavior as illustrated by conditionalstep 525. If yes, the triggering module assembles a complete set oftriggers and actions, and a new future behavior is created where boththe triggers and actions include a combination of values for theinvolved primitive functions at step 550. Note that a future behavioralso can include a combination of several primitive function requests ifthat is what the user specified. For example, “every time I come home,turn on the heat and the kitchen light”. The combination can be of anytype ranging from a weighted sum to a filtered or ranked combinationbased on previous use in prior similar states. The device is thenconfigured to exhibit this respective new future behavior once itstrigger is fulfilled as well as storing this behavior in the futurebehavior database as indicated by step 555. In summary, the triggeringmodule can construct a complex behavior by aggregating device functionsfrom the primitives together where the device behavior seeks to alterthe device state (e.g., a future state, future configuration, currentstate, current configuration, etc.). The device behaviors can also beconstructed based on known or previously executed device functions,possibly obtained from the interaction history database.

If the triggering module at step 525 lacks complete information, then atstep 530 the triggering module can inform the dialog interface about themissing information. In response, at step 535 the dialog interface canconstruct a query for the missing information, which can then bepresented to the user. Step 540 can include receiving a user response toa question based on the query where the user response can be fed backinto the method at step 505.

Again referring back to FIG. 1, the device function database 160 couldalso be indexed by the device state so that only valid functions withrespect to the device state are returned. One should appreciate that adevice state can include a previous device state, a current devicestate, or even a future device state where the device state can bederived from one or more data sources. For example, contemplated deviceinterfaces can include one or more data source connections that allowthe device to acquire environment data representative of the device'senvironment from the data sources. The environment data 135, or evenprevious interactions, can be used to derive the device state 145 bycomparing values or parameters from the environment data to parametersassociated with known device states. It should be further appreciatedthat the device state can also reflect a functional state of the deviceor even a hardware state of the device.

Device behaviors are not necessarily executed upon creation. In viewthat a device behavior can, and most likely will, affect the device'sstate, the behavior should be executed once a satisfactory triggeringcondition has been met. A triggering module can combine the informationlearned thus far and also create a trigger indicating when a devicebehavior should be exhibited. The trigger can be built based the desiredbehavior or the device state. FIG. 6 shows an example trigger datastructure 610. This data structure contains two requirements 620 and 630and one optional condition 640. A requirement is defined as the primarytrigger and thus all always evaluated first. In this example, every timethe current time is 5:30 pm OR if it is the weekend, the future devicebehavior 650 should be triggered 645. An optional condition is definedas an alternative trigger that can trigger the future device behavior650 even if the primary trigger does not evaluate to true. In thisexample, the optional condition covers the case when the user is at homeduring a weekday and the home temperature drops below 67 degrees.

In some cases the trigger condition might indicate the device behaviorshould be exhibited immediately in order to achieve a desired futuredevice state as soon as possible. In other cases, the desired behaviorwill only be exhibited once a future device state is achieved. Thus, atrigger is considered to include rules, requirements, optionalconditions, or other criteria allowing the trigger to be satisfiedaccording to different factors. The trigger can further include timedependency where the trigger is satisfied substantially in real-time orwhere the trigger depends on a future state. In other words, a triggerdoes not necessarily have to be satisfied at the time of creation.

FIG. 7 illustrates the method 700 of triggering a future devicebehavior. At step 705, All sensors and other components that contributeto the device status as discussed above are programmed to send a statusupdate to the device state each time there is a change. Since not allcomponents can be programmed to send updates upon the occurrence of achange, the trigger module additionally always queries the currentdevice status at regular time intervals, for example every 1 minute, asindicated by step 710. At step 715, if any change has been detected inthe device status, the trigger module will query the set of activefuture device behaviors. If there are one or more of these future devicebehaviors that the trigger evaluates to be true, the trigger module willexecute, at step 720, the behavior in the sequence matching theirpriorities.

An astute reader will appreciate that the disclosed systems orinterfaces give rise to complex behaviors through leveraging pastknowledge or future desired device states. The system can determine adesirable device state with respect to its environment, and then seekone or more state paths from the device's current state to the finaldesirable device state. The paths can be built from a series of statesand device functions or behaviors that alter a current state of thedevice until a goal state is achieved. Further, the trigger for eachfunction or behavior can be managed so that the user's interactions withthe device have the greatest chance of success. A device interface canconstruct a queue of device behaviors where the queue aids configuringthe device at appropriate times or circumstances to executecorresponding functions when triggering criteria are met.

As a use-case consider a scenario where a user wants to create a complexbehavior on a mobile phone (operating as a virtual assistant). Suppose,as shown in FIG. 8, the user says “Don't interrupt me during mypresentation today”. The spoken dialog interface for creating complexcommands would process and interpret the user's utterance usinginformation from the calendar database, and the interaction historydatabase to produce the response “Should I set the ring to vibrate onlylike we did for last week's presentation?”. When the user responds “Yes”the system would use calendar information to find the start time of thepresentation and store a delayed action to send a message to the devicein order to set the ring to vibrate only, starting at that time. Theexecution component of the system would then send the message at thedesignated start time to change the ring setting. The system might thenask, based on interaction history and calendar information, “should Iswitch back to vibrate and ring when the presentation is over at 3 pmthe way we did last week?” Suppose the user responds, “No leave it onvibrate until the meeting is over”, using information from the calendar,the system stores a command to send a message to the phone at themeeting end time that will set the ring to the user's usual setting ofring and vibrate. To continue the example, suppose later that day butbefore the presentation the user says “Let me know right away if Maya istrying to contact me.” Using information in the user's contact list toidentify Maya, the system sends message(s) to the device to, forexample, make the phone ring loudly, vibrate and flash on the screen ifan email, text or phone call is received from Maya. Given theinteraction history and the stored actions the system would identify apotential conflict in instructions and ask “Should I notify you duringthe presentation if Maya calls?” The system coordinates information fromvarious sources to make construction of complex behaviors faster andmore successful for the user. For the contemplated systems the componentactions for the complex behaviors or trigger conditions would be limitedonly by the capabilities of the device and the available data sources.

As another use case scenario, FIG. 9 depicts a user initiating aninteraction. The user wishes to establish the complex and conditionalbehavior of adjusting the heat system thermostat when he gets “home fromwork”. The system, not possessing information in the user's calendardata source must obtain additional information in order to clarify howto perform the complex behavior. The system asks the user what time theuser is typically finished with work to which the user replies, “5:30pm”. At this point in the interaction, the system still requires twoadditional pieces of information in order to trigger the complexbehavior, neither of which is in the associated databases. The systemfirst asks what temperature the user would like the thermostat to be setto. Finally, the system needs to clarify the meaning of “home” relativeto a GPS reading. The system, knowing its own location, asks the user ifhe is currently “at home”. When the user responds, “Yes”, the system hasall the information it needs to create and trigger the complex devicebehavior.

One should appreciate that a device behavior represents how the devicepresents itself as a useful device to the user rather than merely acollection of functions. For example, the device behavior not onlyincludes taking actions, but could also include the manner in which ittakes action. Thus, device behavior can be considered to include a kindof “device attitude” of sorts. Further the behavior can take oncharacteristics that reflect a user's desired manner of interacting withdevice. Some users might enjoy engaging with many detailed devicefeatures while other users might enjoy engaging with less features.Therefore, each device could exhibit different behaviors whileaccomplishing the same tasks.

It should be apparent to those skilled in the art that many moremodifications besides those already described are possible withoutdeparting from the inventive concepts herein. The inventive subjectmatter, therefore, is not to be restricted except in the scope of theappended claims. Moreover, in interpreting both the specification andthe claims, all terms should be interpreted in the broadest possiblemanner consistent with the context. In particular, the terms “comprises”and “comprising” should be interpreted as referring to elements,components, or steps in a non-exclusive manner, indicating that thereferenced elements, components, or steps may be present, or utilized,or combined with other elements, components, or steps that are notexpressly referenced. Where the specification claims refers to at leastone of something selected from the group consisting of A, B, C . . . .and N, the text should be interpreted as requiring only one element fromthe group, not A plus N, or B plus N, etc.

1-22. (canceled)
 23. A virtual assistant device comprising: a memory; atleast one processor; a dialog interface including at least one sensorand configured to: accept a digital representation of a dialoginteraction with a user via the at least sensor; and derive aninterpretation of the dialog interaction from signal characteristics ofthe digital representation; and a behavior triggering module comprisingsoftware instruction stored in the memory and that configures the atleast one processor to: receive the interpretation from the dialoginterface; identify a set of primitive device functions from a devicefunction database based on the interpretation; construct a future devicebehavior from the set of primitive device functions and theinterpretation; generate a trigger for the future device behavior as afunction of the future device behavior and a device state associatedwith a target device; and configure the target device to exhibit thefuture device behavior upon satisfaction of the trigger.
 24. The deviceof claim 23, wherein the digital representation comprises arepresentation of a spoken utterance.
 25. The device of claim 24,wherein the digital representation comprise at least one of thefollowing representations from the spoken utterance: a word, a token, asyntax, a modulation, a frequency, a tone, and an accent.
 26. The deviceof claim 23, wherein the digital representation comprises arepresentation of at least one of the following input signals: hapticdata, tactile data, motion data, gesture data, and location data. 27.The device of claim 23, wherein the interpretation comprises a set ofkey-value pairs.
 28. The device of claim 23, wherein the interpretationcomprises a concept identifier.
 29. The device of claim 23, wherein thebehavior triggering module further configures the processor to: identifymissing pieces of information with respect to the future devicebehavior; and cause the dialog interface to construct a query for themissing pieces of information from the user.
 30. The device of claim 23,wherein the trigger comprises a primary trigger.
 31. The device of claim23, wherein the trigger comprises an optional condition.
 32. The deviceof claim 23, wherein the behavior triggering module further configuresthe processor to query the device state to determine satisfaction of thetrigger.
 33. The device of claim 32, wherein the behavior triggeringmodule further configures the processor to query the device state atregular time intervals.
 34. The device of claim 23, wherein the devicestate comprises a past device state.
 35. The device of claim 23, whereinthe device state comprises a further device state.
 36. The device ofclaim 23, wherein the device state includes at least one of thefollowing at attributes: a location, a time, a battery level, a ringermode, a ringer volume, a device environment attribute, a temperature,and a change in location.
 37. The device of claim 23, wherein the futuredevice behavior is a member of a queue of device behaviors.
 38. Thedevice of claim 23, further comprising a cell phone that includes thememory, the at least one processor, the dialog interface, and thebehavior triggering module.
 39. The device of claim 23, furthercomprising a server that includes the memory, the at least oneprocessor, the dialog interface, and the behavior triggering module. 40.The device of claim 23, wherein the future device behavior comprises atleast one of the following: a device command, a reservation, atemperature change, and a multimodal interaction.
 41. The device ofclaim 23, wherein the behavior triggering module further configures theprocessor to resolve ambiguities of a future device behavior based on aninteraction history stored in an interaction history database.
 42. Thedevice of claim 23, wherein the target device comprises the virtualassistant device.